AITopics | Ontologies

Aligning human-interpretable concepts with the internal representations learned by modern machine learning systems remains a central challenge for interpretable AI. We introduce a geometric framework for comparing supervised human concepts with unsupervised intermediate representations extracted from foundation model embeddings. Motivated by the role of conceptual leaps in scientific discovery, we formalise the notion of concept frustration: a contradiction that arises when an unobserved concept induces relationships between known concepts that cannot be made consistent within an existing ontology. We develop task-aligned similarity measures that detect concept frustration between supervised concept-based models and unsupervised representations derived from foundation models, and show that the phenomenon is detectable in task-aligned geometry while conventional Euclidean comparisons fail. Under a linear-Gaussian generative model we derive a closed-form expression for Bayes-optimal concept-based classifier accuracy, decomposing predictive signal into known-known, known-unknown and unknown-unknown contributions and identifying analytically where frustration affects performance. Experiments on synthetic data and real language and vision tasks demonstrate that frustration can be detected in foundation model representations and that incorporating a frustrating concept into an interpretable model reorganises the geometry of learned concept representations, to better align human and machine reasoning. These results suggest a principled framework for diagnosing incomplete concept ontologies and aligning human and machine conceptual reasoning, with implications for the development and validation of safe interpretable AI for high-risk applications.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2603.29654

Country:

Europe > United Kingdom (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology > Security & Privacy (0.67)
Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.54)
(2 more...)

Add feedback

SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query Benchmark

Neural Information Processing SystemsMar-21-2026, 20:47:02 GMT

Electronic health records (EHRs) are stored in various database systems with different database models on heterogeneous storage architectures, such as relational databases, document stores, or graph databases. These different database models have a big impact on query complexity and performance. While this has been a known fact in database research, its implications for the growing number of Text-to-Query systems have surprisingly not been investigated so far.In this paper, we present SM3-Text-to-Query, the first multi-model medical Text-to-Query benchmark based on synthetic patient data from Synthea, following the SNOMED-CT taxonomy---a widely used knowledge graph ontology covering medical terminology. SM3-Text-to-Query provides data representations for relational databases (PostgreSQL), document stores (MongoDB), and graph databases (Neo4j and GraphDB (RDF)), allowing the evaluation across four popular query languages, namely SQL, MQL, Cypher, and SPARQL.We systematically and manually develop 408 template questions, which we augment to construct a benchmark of 10K diverse natural language question/query pairs for these four query languages (40K pairs overall). On our dataset, we evaluate several common in-context-learning (ICL) approaches for a set of representative closed and open-source LLMs.Our evaluation sheds light on the trade-offs between database models and query languages for different ICL strategies and LLMs. Last,SM3-Text-to-Query is easily extendable to additional query languages or real, standard-based patient databases.

artificial intelligence, large language model, natural language, (12 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Health Care Technology > Medical Record (0.59)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.59)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)

Add feedback

End-to-End Ontology Learning with Large Language Models

Neural Information Processing SystemsMar-21-2026, 19:41:28 GMT

Ontologies are useful for automatic machine processing of domain knowledge as they represent it in a structured format. Yet, constructing ontologies requires substantial manual effort. To automate part of this process, large language models (LLMs) have been applied to solve various subtasks of ontology learning. However, this partial ontology learning does not capture the interactions between subtasks. We address this gap by introducing OLLM, a general and scalable method for building the taxonomic backbone of an ontology from scratch.

artificial intelligence, machine learning, natural language, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

d1ebc73cd4c88866b97a6851ece739d1-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 06:21:33 GMT

knowledge management, large language model, machine learning, (21 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(5 more...)

Add feedback

SM3-Text-to-Query: Synthetic M ulti-M odel Medical Text-to-Query Benchmark

Neural Information Processing SystemsFeb-17-2026, 01:58:48 GMT

Text-to-Query systems have surprisingly not been investigated so far.

information retrieval, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.93)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Czechia > Prague (0.04)
(5 more...)

Genre: Research Report (0.67)

Industry:

Health & Medicine > Health Care Providers & Services (0.93)
Government > Regional Government > North America Government > United States Government (0.67)
Information Technology > Security & Privacy (0.67)
Health & Medicine > Health Care Technology > Medical Record (0.46)

Technology:

Information Technology > Databases (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(4 more...)

Add feedback

End-to-End Ontology Learning with Large Language Models

Neural Information Processing SystemsFeb-17-2026, 00:39:00 GMT

Ontologies are useful for automatic machine processing of domain knowledge as they represent it in a structured format.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Europe > France (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)
(3 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine (0.46)
Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

838694e9ab6b0a193b84daaafcac0eed-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-15-2026, 14:49:22 GMT

artificial intelligence, cloud computing, metadata, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
Europe > Switzerland > Vaud > Lausanne (0.04)

Genre: Workflow (0.49)

Industry: Information Technology > Security & Privacy (0.47)

Technology:

Information Technology > Cloud Computing (0.69)
Information Technology > Information Management (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.32)

Add feedback

680da2fd0331deecc2e5b7cf0e55e832-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 13:22:50 GMT

information, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales (0.04)
Asia > China > Hong Kong (0.04)
North America > United States (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Energy > Renewable (0.68)
Energy > Energy Storage (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
(2 more...)

Add feedback

DRAGON - Deep Bidirectional Language-Knowledge Graph Pretraining

Neural Information Processing SystemsFeb-12-2026, 20:01:38 GMT

ragon, reasoning, representation, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.93)

Industry:

Education (0.69)
Health & Medicine > Therapeutic Area (0.68)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.66)
(2 more...)

Add feedback

A Data Analysis The LoRA Dataset Project page:https: //lora-vqa.github.io/

Neural Information Processing SystemsFeb-12-2026, 15:52:14 GMT

Each question and answer group has a unique list of corresponding visuals used for image creation. The list of visible objects, which combines the correct-answer objects with an arbitrary'noise' object

artificial intelligence, logical question, natural language, (18 more...)

Neural Information Processing Systems

Industry: Education > Health & Safety > School Nutrition (0.46)

Technology: